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I. INTRODUCTION 


Faced with a complex and dynamic panorama and, 
therefore, adaptations that cross almost all institutional 
environments, the apparent success of the planning activity 
needs to contemplate the use of useful and available 
information, combining the application of techniques that 
contain a quantitative approach and aided by conceptual 
structures to enable more qualified interpretations. Provost 
and Fawcett (2013b) state that understanding this process 
and its stages makes it more systematic and, consequently, 
less subject to errors and inconsistencies. In this sense, two 
techniques for effectively analyzing the data are explained: 
principal component analysis (PCA) and cluster analysis. 


Principal Component Analysis (PCA) is used when 
large multivariate data sets are analyzed, as it allows for 
reducing the dimensionality (number of components or 
dimensions) of a data set. With it, it is possible to replace a 
set of original variables p with a smaller number q. This 
reduction is made by searching for linear combinations, 
ay'X, a2'X,..., aqx called principal components, which have 
a maximum variance for the data, and are not correlated 
with previous combinations акхѕ. When solving this 
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Abstract— The present study provides an overview of energy production 
in Brazil and in other 53 (fifty-three) countries, considering 18 (eighteen) 
variables of interest collected from the World Bank’s database of years 
1997, 2002, 2007 and 2012. With an essentially quantitative approach and 
used Principal Components Analysis (PCA), 10 (ten) variables were 
adopted that most contributed to the variance of the data. In parallel, 
through the clustering technique, the countries were grouped by common 
characteristics, observing that Brazil, even with privileged water 
resources, has walked in the opposite direction of the other countries, 
presenting an involution in some reference indicators. The main 
contribution of the study is the design of an analysis model capable of 
defining, preliminarily, a design of the Brazilian energy matrix from the 
end of the 90s to the year 2012, a period that includes the publication of 


the secondary data used. 


maximization problem, one finds vectors ai, a», ..., aq that 
are the eigenvectors of the S data covariance matrix, which 
correspond to the q largest eigenvalues. These eigenvalues 
provide the variance of their respective principal 
components (Jolliffe, 2005). The relationship between the 
sum of the first q eigenvalues and the sum of the variances 
of all the p original variables represent the proportion of 
the total variance in the original set of data, calculated by 
the first q principal components (Jolliffe, 2005). 


Kaufman and Rousseuw (2005) define cluster analysis 
as the art of finding clusters in data. And Jain (2010) as a 
multivariate technique to evaluate natural clusters of a set 
of patterns, points, or objects. From an operational point of 
view, clustering is defined as a representation of n objects 
to find K clusters based on a measure of similarity such 
that the similarities between objects of the same cluster are 
high, while the similarities between different objects are 
low. Jain (2010) presents 3 (three) main objectives in the 
use of data cluster analysis: (1) underlying structure, that is, 
to obtain an overview of the data, formulate hypotheses, 
detect anomalies, and identify prominent characteristics; 
(i) natural classification by identifying the degree of 
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similarity between forms or organisms (phylogenetic 
relationship); and (iii) compression by organizing the data 
in order to summarize them through cluster prototypes. 


Sun et al. (2017) and Jain (2010) claim that clustering 
plays an important role in business contexts, being widely 
used in market segmentation, product categorization, the 
analysis of customer profiles, behavioral patterns, and 
performance levels workflow types, and business models. 


According to Ketchen Jr and Shook (1996), cluster 
analysis takes a sample of elements as an organization and 
groups them so that the statistical variance between the 
elements is minimized while the variance between the 
clusters is maximized. Studies of this nature were initially 
addressed in industrial and organizational economics 
literature with groups defined through sets with few 
variables. 


This methodology is prevalent in disciplines involving 
multivariate data analysis and is widely used in the 
managerial context, as it allows summarizing a large 
amount of data from a simplified visualization of 
interpretation. Although the K-means algorithm is robust 
to handle relatively large volumes of data and fast requires 
that the number of clusters is specified and sensitive to 
outliers’ presence (Lopes & Gosling, 2021). 


П. THE DESCRIPTION AND DISCUSSIONS THE 
STUDY 


Using both analysis techniques (PCA and Clustering) 
presented and having as reference the model proposed by 
da Silva et al (2022) and moreover searching to determine 
a representation for the Brazilian electrical matrix. The 
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first part of propose of application to consist to used 18 
variables with historical series from 1960 to 2020 
preliminarily selected of 266 countries of World Bank's 
(2020) database, which were: ANE - Alternative and 
nuclear energy (% of total energy use); APD - Average 
precipitation in depth (mm per year); ATE - Access to 
electricity (% of population); CPI - Consumer price index 
(2010 = 100); EIN - Energy imports, net (% of energy 
use); EPC - Electric power consumption (kWh per capita); 
EPFNS - Electricity production from nuclear sources (96 of 
total); EPFOS - Electricity production from oil sources (76 
of total); EPFRS - Electricity production from renewable 
sources, excluding hydroelectric (kWh); EPOGCS - 
Electricity production from oil, gas, and coal sources (% of 
total; EPTDL - Electric power transmission and 
distribution losses (96 of output); EU - Energy use (kg of 
oil equivalent per capita); FFEC - Fossil fuel energy 
consumption (96 of total); ICP - Inflation, consumer prices 
(annual 96); IEWPP - Investment in energy with private 
participation (current US$); MEES - Methane emissions in 
energy sector (thousand metric tons of CO» equivalent); 
REC - Renewable energy consumption (% of total final 
energy consumption); RIFRT - Renewable internal 
freshwater resources, total (billion cubic meters). 


All statistical analyses and procedures were 
performed in the computer environment R in version 4.0.5 
with the aid of the packages “tidyverse” (Wickham et al., 
2019), “ҒасіоМіпег”, and "factoextra" (Kassambara & 
Mundt, 2020). Initially, Table 1 presents the descriptive 
statistics values (average, standard deviation, the minimum 
and maximum values, and the percentage of unavailable 
data (NA) for the sample. 


Table 1: Descriptive statistics of the selected variables 


N° Code Variable Average Standard Minimum Maximum % of NA 
Deviation 
Е Year З 1960 2020 ш 
L|. ANE О а кы кы 9.93 0.00 71.54 54.66 
of total energy use) 
2 APD Average precipitation indepth о ооо 811.00 18.10 3240.00 84.95 
(mm per year) 
Access to electricit f 
3 ATE erase be elect M 99.50 29.46 0.53 100.00 57.68 
population) 
m 2010 = 
4 CPI си Ro E со 68.50 310.14 0.00 20422.89 38.12 
Energy imports, net (% of energy 
5 EIN E -73.14 555.55 -17632.77 | 100.00 5447 
EP ton (k 
6 ЕРС л имаат 576 54799.17 54.76 
per capita) 


www.ijaers.com 


Page | 18 


Silva and Dias Junior. 


International Journal of Advanced Engineering Research and Science, 9(12)-2022 


Standard 


N? Code Variable Average nM Minimum Maximum % of NA 
Deviation 
7 EPENS Ene rode пыш 4.63 12.64 0.00 87.99 54.71 
nuclear sources (% of total) 
8 крк | О productionem oil у 3024 0.00 100.00 53.90 
gas, and coal sources (96 of total) 
9 EPFRS Electricity production from oil 4.88E+0 A4 A8E410 0.00 1.65E+12 53.81 
sources (% of total) 9 
EPOGC Electricity production deu 
10 S renewable sources, excluding 59.67 33.91 0.00 100.00 53.90 
hydroelectric (KWh) 
ji^ erme | СЕРЕ шыша i xe 9.03 0.00 88.02 55.18 
distribution losses (% of output) 
12 EU Еос л 9.55 40710.11 53.48 
per capita) 
Fossil fuel ti 
13 | FFEC озы шю energy consumption | 66.59 30.47 0.00 100.00 54.97 
(% of total) 
Inflati i 1 
14 ICP ишо pui И 332.40 -18.11 23773.13 38.09 
‘0 
15 | tewpp | Yesbmentinenergy wath private | LOSE) | 5.706409 | 5.00B+05 | 3.45E+10 | 92.63 
participation (current US$) 9 
Methane emissions in energy 3187680.0 
16 MEES sector (thousand metric tons of 24285.00 170798.80 0.00 0 ` 23.46 
CO2 equivalent) 
Renewable energy consumption 
17 REC (96 of total final energy 30.55 30.71 0.00 98.34 54.19 
consumption) 
Renewable internal freshwater 
18 RIFRT resources, total (billion cubic 530.60 3183.50 0.00 43392.16 84.72 
meters) 


The values obtained from the calculations for all 
available years are presented. It is observed that data are 
not available for all countries and years considered in the 
universe of the selected variables as observed in the 
column % of NA. Therefore, to perform the cluster 
analysis, it was necessary to exclude the not available 
(NA) values from the database the values, keeping in the 
database only the occurrences of countries with data 
available in all variables. Of the total, 111 occurrences 
were obtained. Table 2 presents the descriptive statistics 
for the new database. 
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Table 2: Descriptive statistics after excluding NA values 


N° Variable Average Min Max 
- Year - 1992 2012 
1 ANE 5.924 0.000 54.435 
2 APD 1335.9 18.100 3240.0 
3 ATE 82.450 6.197 100.000 
4 CPI 79.37282 0.05003 129.47112 
5 EIN -9.577 -332.610 96.723 
6 EPC 1464.26 50.33 6617.13 
7 EPFNS 0.000 0.000 47.155 
8 EPFOS 5.299 0.000 95.879 
9 EPFRS 4.072е+09 0.000 1.325e+11 
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N° Variable Average Min Max 
10 | EPOGCS 66.42 0.000 100.00 
11 | EPTDL 14.005 2.065 40.933 
12 EU 1105.0 139.1 5167.0 
13 FFEC 65.831 7.685 99.859 
14 ICP 15.134 -10.068 951.962 
15 IEWPP | 1.540е+09 | 4.100e+06 | 3.035e+10 
16 MEES 48821 170 677000 
17 REC 32.8303 0.1804 92.7479 
18 RIFRT 867.499 0.682 5661.000 


To analyze principal components (PCA) and clusters, it 
is necessary to standardize the data according to the scale z 
(or z-score). Thus, the data were normalized (mean=0) 
with the unit of the new scale being a standard deviation, 
and the variance contained in each component was 
measured by eigenvalues, as shown in Table 3. 


Table 3: Variance retained in each of the components 


Component Eigenvalues тиш Cumulative 
(%) Variance (%) 
7 0.984 5.466 78.845 
0.817 4.540 83.385 
0.684 3.802 87.186 
10 0.601 3.341 90.528 
11 0.473 2.627 93.154 
12 0.403 2.238 95.392 
13 0.294 1.631 97.023 
14 0.199 1.107 98.130 
15 0.176 0.980 99.110 
16 0.119 0.659 99.769 
17 0.023 0.126 99.895 
18 0.019 0.105 100.000 


Cumulative 
Variance (46) 


Variance 


Component (46) 


Eigenvalues 


6 1.117 


6.203 73.379 


Components with eigenvalues greater than 1 (one) 
indicate that the variance of the component is greater than 
that, which would represent the variance of the original 
data. According to Battisti and Smolski (2019), this may 
be a criterion for deciding on the number of components to 
be used. The first 5 components correspond to almost 70% 
of the explanation of the variance of the data set (as 
highlighted in Table 3) and are therefore maintained for 
analysis purposes. 


Besides Table 3 with the eigenvalues, it is possible to 
evaluate the correlation between the selected variables and 
each of the 5 components identified. Table 4 presents the 
measure of this correlation. 


Table 4: Correlation between the selected variables and the 5 components obtained 


N° Var Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 
1 ANE 0.253 0.150 0.054 
2 APD -0.026 
3 ATE 0.039 
4 CPI 
5 EIN 
6 EPC 
7 EPFNS 
8 EPFOGCS 
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N? Var Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 
9 EPFOS 0.155 -0.110 0.393 
10 EPFRS 0.246 0.217 EL NN 0.413 | 0.136 
11 EPTDL -0.291 0.099 -0.193 -0.140 939 
12 ЕП 0.023 -0.192 0.114 
13 FFEC 0.052 -0.092 
14 ICP -0.021 0.345 0.050 0.147 
15 IEWPP 0.238 | 0.426 o oae 0.393 0.001 
16 MEES 0.598 | 0.182 ЕСИ -0.113 0.245 
17 КЕС 0.345 0.099 0.058 0.169 
18 КІЕКТ 0.338 -0.194 -0.004 


It is observed that 9 variables APD, ATE, EPFNS, 
EPFNS, EPFOS, REC, FFEC, EPC, EU, and MEES) 
showed a high correlation with Component 1. And ATE, 
EPFNS, EPC, EU, FFEC, and MEES (highlighted in 
green) are positively correlated (directly proportional), and 
APD, EPENS, EPFOS, and REC (highlighted in red) are 
negatively correlated (inversely proportional). Thus 
Component 1 is directly related to access to electricity and 
non-renewable energy sources and is inversely 
proportional to energy consumption from renewable 
sources. 


Similarly, Component 2 showed a positive correlation 
with the variables ANE, ICP, IEWPP, MEES, REC, 
RIFRT and a negative correlation with the variables 
EPFOGCS, EPFOS, and FFEC. Thus, component 2 is 
positively related to renewable energy sources and 
negatively related to non-renewable and fossil fuel-derived 
energies. 


Concerning Component 3, this correlates positively 
with methane emission (MEES) and alternative energy 
sources other than hydroelectric (EPFRS). On the other 
hand, it correlates negatively with alternative nuclear 
energies (ANE). 


In short, components 1 and 2 are responsible for 
explaining more than 40.8% of the total variance 
according to the data, especially Component 1 as shown in 
Figure 1. 
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Scree plot 


Percentage of explained variances 


Dimensions 


Fig 1: Percentage (%) of the variance of each component. 


As shown in Table 4 and as shown in Figure 2, it can 
be seen which of the variables initially chosen for this 
study are the ones that most contribute to explaining the 
variance of Components | and 2. 
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Contribution of variables to Dim-1-2 


Contributions (96) 
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Thus, the cutoff was defined as the 10 variables (REC, 
EPC, FFEC, EU, EPOGCS, ATE, RIFRT, ANE, MEES, 
and EPFOS) that most contributed to the explanation of 
variance. The remaining 8 variables (APD, EPFNS, 
IEWPP, ICP, EPTDL, CPI, and EIN) were discarded 
because they contributed less than 1/5 of the total variance. 


After excluding the NA, the countries’ data were 
grouped in 1997, 2002, 2007, and 2012. These years and 
countries were selected, considering that they had all the 


& g d © ? € Ф © X $ E #& § 8 d Ф T г 
& $ K e a € ¥ € £ o & 8 5 


3 


data available. 


Fig. 2: Variables that most contribute to Components 1 


and 2. 


Table 5: Countries grouped by year 


Year Countries N° 
1997 Bangladesh, Bolivia, Brazil, Kazakhstan, Colombia, Philippines, Guatemala, India, Indonesia, 7 
Morocco, Mexico, Peru, Russia, Senegal, Tanzania, Vietnam, and Zambia 
Albania, Armenia, Azerbaijan, Bolivia, Brazil, Cameroon, Cambodia, China, Colombia, 
2002 Philippines, Guatemala, India, Indonesia, Mexico, Nicaragua, Nigeria, Peru, Dominican 24 
Republic, Russia, Sri Lanka, Tajikistan, Tunisia, Ukraine, and Vietnam. 
Armenia, Bangladesh, Belarus, Bolivia, Brazil, Bulgaria, Cambodia, China, Colombia, Egypt, 
2007 Philippines, Ghana, Georgia, Guatemala, India, Indonesia, Iran, Iraq, Jordan, Macedonia, Mexico, 31 
Moldova, Nicaragua, Nigeria, Pakistan, Peru, Romania, Russia, Sri Lanka, Ukraine, and Vietnam 
South Africa, Albania, Algeria, Bangladesh, Bosnia and Herzegovina, Brazil, Bulgaria, China, 
Colombia, Cote d’Ivoire, Costa Rica, Egypt, Philippines, Gabon, Ghana, Guatemala, India, 
2012 Indonesia, Jordan, Malaysia, Morocco, Mexico, Mongolia, Nepal, Nicaragua, Pakistan, Peru, 37 
Kenya, Dominican Republic, Romania, Russia, Sri Lanka, Thailand, Turkey, Ukraine, Vietnam, 
and Zambia. 
After defining the sets of countries to be considered for pen ыш аа 
cluster analysis, it was determined how these countries = 
were grouped according to their similarities and 
differences in each of the years considered. As for 1997, " 
the countries were grouped into 2 clusters, as shown in the 8 
graph presented in Figure 3. E 
40 
= 
20 
1 2 3 4 5 6 8 9 10 
Number of clusters k 
Fig. 3: Optimal number of clusters for 1997 
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According to Components 1 and 2, the countries were 
grouped as shown in Figure 4. 


Cluster Plot: Ano 1997 


cluster 


y 1 > 
< 1 
\ * 


От (47 6%; 


Fig 4: Clusters obtained for 1997 


The countries belonging to Cluster 1 are Bangladesh, 
Bolivia, Guatemala, Senegal, Tanzania, and Zambia. They 
stood out negatively for having the worst levels of access 
to electricity (ATE) and positively for having the highest 
levels of renewable energy consumption. They also have 
the lowest levels of energy consumption from fossil fuels. 


Cluster 2 includes Brazil (country # 3 in Figure 8), 
Colombia, Philippines, Indonesia, India, Kazakhstan, 
Morocco, Mexico, Peru, Russia, and Vietnam. Brazil stood 
out as the holder of the largest number of renewable waters 
(RIFRT) among all 17 countries. 


Itis also noteworthy in 1997 that, except for Russia, all 
presented energy use (EU) and electricity consumption 
(EPC) below the average of the countries present in this 
study. The full results for 1997 are presented in the 
Annexes section. 


In 2002, the existence of 3 clusters was verified, as 
shown in Figure 5. 


Optimal number of clusters 
150 ] 


n Sum of Square 
© 
е 


Total Withi 


1 2 3 4 5 6 7 8 9 10 
Number of clusters k 


Fig. 5: Optimal number of clusters for 2002 
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The following countries belong to Cluster 1: 
Azerbaijan, Bolivia, China, Colombia, Dominican 
Republic, Indonesia, India, Mexico, Philippines, Russia, 
Tunisia, Ukraine, and Vietnam. These countries stand out 
from the others, mainly concerning the higher fossil fuel 
energy consumption (FFEC), electricity production from 
oil, gas, and coal (EPOGCS), and the percentage of their 
populations with access to electricity (ATE). 


In 2002, once again, Russia stood out as the only 
country with per capita electricity consumption (EPC) and 
energy use (EU) above average among all the other 
countries in this study. 


Cluster 2 includes Cameroon, Guatemala, Cambodia, 
Sri Lanka, Nigeria, and Nicaragua. The countries in this 
cluster, except for Cameroon, presented higher energy 
production values from oil, gas, and coal (EPOGCS), 
however, they presented the best numbers in relation to 
energy consumption from renewable sources. Also in this 
cluster, Cameroon, Cambodia, and Nigeria had the lowest 
fossil fuel consumption figures (FFEC). 


Cluster 3 includes Albania, Armenia, Brazil (country # 
5 in Figure 10), Peru, and Tajikistan. These countries 
presented the lowest results for energy production from 
oil, gas, and coal (EPOGCS). They also stood out in the 
use of alternative and nuclear-derived energies (ANE). The 
complete results for 2002 are also presented in the 
Annexes section. 


The visualization of the clusters in the space defined by 
Components | and 2 for 2002 is in Figure 6. 


Cluster Plot: Ano 2002 


Dim: 
<__ 
= 


От (43.4%) 


Fig. 6: Clusters obtained for 2002 


In 2007, there are 2 (two) clusters of countries, as 
indicated in Figure 7. 
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Optimal number of clusters 


1251 


1004 


Total Within Sum of Square 
E 
o 


a 
© 


1 2 3 4 5 6 
Number of clusters k 


Fig 7: Optimal number of clusters for 2007 


The countries belonging to Cluster 1 are Bangladesh, 
Brazil (country # 5 in Figure 12), Colombia, Georgia, 
Ghana, Guatemala, Indonesia, India, Cambodia, Sri Lanka, 
Nigeria, Nicaragua, Pakistan, Peru, Philippines, and 
Vietnam. In this cluster, it is noteworthy that all countries 
presented below-average indices for electricity 
consumption (EPC), energy use (EU), and fossil fuel 
consumption (FFEC). And values above the average for 
renewable energy consumption (REC). And to cluster 2 
Armenia, Bulgaria, Belarus, Bolivia, China, Egypt, Iran, 
Iraq, Jordan, Moldova, Mexico, Macedonia, Romania, 
Russia, and Ukraine. This cluster of countries presented 
below-average values for renewable energy consumption 
(REC) and electricity production. from oil sources 
(EPFOS). For the variables fossil fuel energy consumption 
(FFEC), energy production from oil, gas, and coal 
(EPOGCS), and access to electricity (ATE) above average. 
This year's full results are presented in the Annexes 
section, and clusters can be viewed in Figure 8. 


Cluster Plot: Ano 2007 


222! 
а 16 
E 
7 ^ 
a 28 5 


cluster 


ПЕ 
[4] 2 


Оіт2 (19.3%) 


Dim! (44 6%) 


Fig. 8: Clusters obtained for 2007 
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For 2012, 2 clusters were also observed, as shown 
in Figure 9. 


Optimal number of clusters 


1001 


Total Within Sum of Square 


+ 
1 2 3 B 5 6 7 8 9 10 
Number of clusters k 


Fig. 9: Optimal number of clusters for 2012 


The countries belonging to Cluster 1 are Bangladesh, 
Bulgaria, Bosnia and Herzegovina, China, the Dominican 
Republic, Algeria, Egypt, Indonesia, India, Jordan, 
Morocco, Mexico, Mongolia, Malaysia, Peru, Philippines, 
Romania, Russia, Thailand, Turkey, Ukraine, Vietnam, 
and South Africa. These countries showed low energy use 
generated from renewable sources (REC). 


Cluster 2 includes Albania, Brazil (country # 5 in 
Figure 14), Cote d'Ivoire, Colombia, Costa Rica, Gabon, 
Ghana, Guatemala, Kenya, Sri Lanka, Nicaragua, Nepal, 
Pakistan, and Zambia. In this cluster of countries, it is 
observed that the variables per capita electricity 
consumption (EPC), fossil fuel energy consumption 
(FFEC), energy use (EU), and electricity generated from 
oil, gas, and coal (EPOGCS) below average and inverse 
behavior concerning energy consumption from renewable 
sources (REC). 


The 2 (two) clusters showed similar behavior in 
relation to energy production from oil-related sources 
(EPFOS) and levels close to the average in relation to 
methane emissions (MEES), the exceptions to methane 
emissions were China  (3.8215409) апа Russia 
(3.7677935), which presented the highest emission levels. 


The full results for 2012 are presented in the Appendix 
section, and the clusters can be viewed in Figure 10. 
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Cluster Plot: Ano 2012 


ті (42 3%) 


Fig 10: Clusters obtained for 2012 
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Ш. ANALYSIS OF RESULTS 


The results of Brazil within the clusters over time were 
summarized in Chart 1. Its position was classified 
according to 3 (three) criteria: high (when grouped with 
the countries that presented the highest results), 
intermediate (grouped with the countries that presented 
intermediate values), and low (grouped with the countries 
that presented the lowest values) for each of the variables. 
Colors are a qualitative indication of the result. That is, 
green means that the numbers are relatively good, and red 
is considered bad. For example, the low level of methane 
emission was highlighted in green. 


Table 6: Cluster in which Brazil is for each selected variable and in each year of analysis 


Year 1997 


Alternative and Nuclear 
Energy (% of total energy 
used) (ANE) 


Access to electricity (% of 
population) (ATE) 


Electricity Consumption (kWh 
per capita) (EPC) 


Electricity production from oil 
sources (% of total) (EPFOS) 


Electricity production from 
oil, gas, and coal sources (% of 
total) (EPOGCS) 


Energy use (kg of oil 
equivalent per capita) (EU) 


Fossil fuel energy consumption 
(% of total) (FFEC) 


Methane emission in the 
energy sector (thousand metric 
tons of CO2 equivalent) 
(MEES) 


Renewable energy 
consumption (% of total final 
consumption) (REC) 


Renewable internal freshwater 
resources, total (billions of 
cubic meters) (RIFRT) 


It is observed that Brazil was similar to countries that 
presented a reduction in fossil fuel energy consumption 
(FFEC) and those that increased renewable energy 
consumption (REC). As for electricity production from oil, 
gas, and coal sources (EPOGCS), it always remained 
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2002 2007 2012 


Intermediate 


Intermediate 


Intermediate 


among the 5 (five) countries with the lowest values. 
Regarding energy use (EU), it was always grouped with 
the countries that presented the lowest consumption levels. 
And concerning methane emission in the energy sector 
(MEES), it always remained at low levels. As for 
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renewable internal freshwater resources (RIFRT), despite 
recurring water crises, it is verified that, historically, 
Brazil's position is privileged in relation to other countries 
in this regard. 


In 2007, there was a change in the cluster in the use of 
alternative and nuclear energy (ANE), electricity 
production from oil sources (EPFOS), access to electricity 
(ATE), and electricity consumption (EPC). That is, it was 
grouped with countries that presented numbers considered 
poor. And this trend in the cluster remained for the ATE 
and EPC variables in 2012. In this context, more data 
would be appropriate to verify whether this trend has 
continued to the present day and how Brazil has advanced 
in the context of its energy matrix. 


IV. CONCLUSIONS 


As noted by Tolmasquim, Guerreiro, and Gorini 
(2007), the importance of the Brazilian energy sector 
cannot dispense with a continuous, systematized, and 
dynamic knowledge process in the face of the challenges 
of creating conditions for a rapid expansion of its supply 
that conditions the enhancement of productive investments 
on demand. 


As suggestions for future studies aimed at deepening 
the results obtained, it is suggested to deepen the 
evaluation of the scales of the variables used and the 
application of other possible clustering techniques to 
obtain even more robust elements that support the rationale 
and plausibility of the proposed model as recommended by 
Lopes and Gosling (2021). 


It is noteworthy that the objective of using quantitative 
variables to determine potential constituent elements of an 
energy matrix was achieved. Nevertheless, from the results 
obtained with clustering, it was possible to better 
understand the evolution of the status and repercussion of 
the Brazilian energy matrix compared to the other 
countries included in the analysis. It should be noted that 
the main limitation of the study focuses on the fact that the 
database used does not have complete data (NA) to obtain 
an even more complete and in-depth temporal analysis 
since it imposes limitations in the definition of the 
conditions found. 


REFERENCES 
[1] [1] Battisti, I. D. E, & Smolski, F. M. da S. (2019). 
Software R: curso ауапсаао. 


https://smolski.github.io/livroavancado/ 

[2] [2] Jain, A. K. (2010). Data clustering: 50 years beyond K- 
means. Pattern Recognition Letters, 31(8), 651—666. 
https://doi.org/10.1016/j.patrec.2009.09.011 


www.ijaers.com 


International Journal of Advanced Engineering Research and Science, 9(12)-2022 


[3] [3] Jolliffe, I. (2005). Principal Component Analysis. Em В. 
S. Everitt & D. C. Howell (Orgs.), Encyclopedia of Statistics 
in Behavioral Science (p. bsa501). John Wiley & Sons, Ltd. 
https://doi.org/10.1002/0470013192.bsa501 

[4] [4] Kassambara, A., & Mundt, F. (2020). factoextra: Extract 
and Visualize the Results of Multivariate Data Analyses. 

[5] [5] Kaufman, L., & Rousseeuw, Р. J. (2005). Finding 
groups in data: An introduction to cluster analysis. Wiley. 

[6] [6] KETCHEN Jr., D. J., & Shook, C. L. (1996). THE 
APPLICATION OF CLUSTER ANALYSIS IN 
STRATEGIC MANAGEMENT RESEARCH: АМ 
ANALYSIS AND CRITIQUE. Strategic Management 
Journal, 17(6), 441—458. 
https://doi.org/10.1002/(SICT) 1097- 
0266(199606)17:6«441:: AID-SMJ819>3.0.CO;2-G 

[7] [7] Lopes, H. E. G., & Gosling, M. de S. (2021). Cluster 
Analysis in Practice: Dealing with Outliers in Managerial 
Research. Revista de Administração Contemporánea, 25(1), 
e200081. https://doi.org/10.1590/1982-7849rac202 120008 1 

[8] [8] Silva, К. S. S. da, Dias Junior, C. M., Leite, A. L. da S., 
& Cario, S. A. F. (2022). A Representation Model of 
Brazilian Energy Matrix. International Journal of 
Development Research, 12(10), 9. 
https://doi.org/10.37118/1jdr.25603.10.2022 

[9] [9] Sun, L., Chen, G., Xiong, H., & Guo, C. (2017). Cluster 
Analysis in Data-Driven Management and Decisions. 
Journal of Management Science and Engineering, 2(4), 
227-251. https://doi.org/10.3724/SP J.1383.204011 

[10] [10] The World Bank. (2020). The World Bank | Data 
[Institucional]. 
https://data. worldbank.org/indicator 

[11] [11] Tolmasquim, М. T., Guerreiro, A., & Gorini, К. (2007). 
Matriz energética brasileira: Uma prospectiva. Novos 
Estudos - CEBRAP, 79, 47-69. 
https://doi.org/10.1590/S0101-33002007000300003 

[12] [12] Wickham, H., Averick, M., Bryan, J., Chang, W., 
McGowan, L., Francois, R., Grolemund, G., Hayes, A., 
Henry, L., Hester, Ј., Kuhn, M., Pedersen, T., Miller, E., 
Bache, S., Müller, K., Ooms, J., Robinson, D., Seidel, D., 
Spinu, V., ... Yutani, H. (2019). Welcome to the Tidyverse. 
Journal of Open Source Software, 4(43), 1686. 
https://doi.org/10.21105/joss.01686 


Indicators. 


Page | 26 


Silva and Dias Junior. International Journal of Advanced Engineering Research and Science, 9(12)-2022 


APPENDIX 


Table 7: Results of clusters of countries in 1997 


N° Country Year REC EPC FFEC EU EPOGCS ATE RIFRT ANE MEES EPFOS Cluster 


1 Bangladesh 1997 0.9581 0.7022 0.3287 0.7399 1.0105 -0.1337 0.6095 -0.1293  -0.4814 
2 Guatemala 1997 1.1317 0.6666 0.9299 0.6109 -0.8223 0.4198 -0.1324 0.3775 -0.1274 0.2575 
3 Senegal 1997 0.6552 0.6946 0.6406 0.7126 re 56 56 -0.1586 0.6392 -0.1392 


4 Tanzania 1997 -0.1403 -0.1100 -0.0540 


5 Zambia 1997 -0.1415 0.2084 -0.1302 -0.7803 


6 Bolivia 1997 -0.1806 -0.1321 -0.7551 
7 Brazil 1997 0.4290 io ci оз " di дз 15826 -0.0470 — -0.6869 
8 Colombia 1997 aie he ere 0.3039 R -0.9233 0.4181 0.5071 0.1502 -0.0914 -0.7724 
9 Indonesia 1997 0.5385 Dic itd rod 0.8844 OT 0.4675 feos 0.1775 -0.0780 
10 India 1997 0.7144 | з Dict бос 4; 060 ОЬ 0.2875 0391 0.2887 -0.6471 
11 Kazakhstan 1997 Е US 1.0101 0.0632 0.8208 0.6109 -0.1465 КЕ -0.0682 -0.5521 
12 Morocco 1997 TOUR Т AT 0.6539 е 0.7374 m -0.1576 DRITTE -0.1259 0.4491 
13 Mexico 1997 USD TUS 0.6586 ос 0.4167 0.5111 -0.0382 d 0.0682 0.7854 
14 Peru 1997 0.0388 DIS 0.0665 QOO 1.0052 1o 0.3488 0.2220 -0.1315 -0.0076 
15 Philippines 1997 0.1059 | Я (юй PETS 0.2034 mn 66 091€ n -0.1195 0.7950 
16 коа 1997 ioo. 03840 [08363 0.5556 02308 ОСИ -0.5984 
17 Vietnam 1997 0.9848 | Us T m " 2793 -0.6073 mU m Aenea -0.0964 -0.4169 
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Table 8: Results of clusters of countries in 2002 


N° Country Year REC EPC  FFEC EU EPOGCS ATE RIFRT ANE МЕЕ$ EPFOS Cluster 


1 Azerbaijan 2002 Ооа 9.9739 10083 (9; 08708 0.6357 -0.1641 9. -0.0528 0.2309 
2 Bolivia 2002 mre б. p. 04683 4 e -0.4013 UE вз 00713 оу -0.1328 -0.7529 
3 China 2002 me "m. JL 04795 9 б 0.6250 0.5504 0.7169 ne 1.8745 -0.6918 
4 Colombia 2002 acer m д 0274 ee -1.0665 0.4639 0.5071 0.2782 -0.0815 -0.7863 


Dominican - - - - 


711 92 .287 -0.1 -0.1 p 

5 Republic 00 0.4089 0.4097 б 3 0.5050 MU E 015933 0.5132 E Б 

j ; Й b j | ; ; Ў -0.0214 
6 Indonesia 2002 0.4594 0.6275 0.1326 0.5351 0.7484 0.2148 0.4675 0.5710 0.1056 -0.0 
7 India 2002 0.6597 0.7402 0.2875 0.3350 -0.6419 
8 Mexico 2002 0.7198 0.6136 0.5643 -0.0382 0.0451 0.3437 
9 Philippines 2002 0.0743 0.1383 -0.0162 -0.1146 -0.3639 

Russian - 
10 : 2002 0.4665 0.7930 0.6276 0.1686 0.6357 1.1878 0.1201 2.8874 -0.6913 
Federation 0.8821 

11 Tunisia 2002 0.5210 0.4823 0.6722 0.5248 1.1659 0.5611 -0.1654 0.6305 -0.1304 -0.4592 
12 Ukraine 2002 E т 0.5808 0.1415 -0.2995 0.6357 -0.1494 0.9088 0.3066 -0.7777 


13 Vietnam 2002 0.7133 -0.0538 -0.0737 -0.3880 


14 Cameroon 2002 1.7784 -0.0809 -0.1264 -0.6749 


15 Guatemala 2002 1.0840 0.6392 0.7314 0.5891 -0.0414 0.0884 -0.1324 0.4750 -0.1230 0.6109 


-0.1288 -0.1325 2.3770 


16 Cambodia 2002 1.7086 1.0678 


17 SriLanka 2002 1.0275 0.6531 0.7220 0.6439 0.0634 0.0194 -0.1501 0.4333 -0.1328 1.2508 


о 

a 1 

[ony 

со 
© © = 
lon o2 lon 
=) 1 С 1 N 1 
E 66 o 
o MJ о 
© = 
Q2 © 
[V 1 J 1 
= A 
ч N 
© © o 
en 1 КЭ 1 R L 
o0 Ne) a 
сл © оо 
© © 
UA 1 FE 1 
— шә 
N © 
© © © 
bo bo dus 
oy n na! G 1! 
N CA © 
— оо © 


18 Nigeria 2002 1.7597 0.0624 -0.0973 0.1623 -0.7932 
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N° Country Year REC EPC FFEC EU  EPOGCS ATE RIFRT ANE  MEES EPFOS Cluster 
19 Nicaragua 2002 0.7854 ME У Т dos 0.4645 ORO] -0.1176 05315 -0.1374 1.7005 2 
20 Albania 2002 0.1741 0026 ТЕ oe 0.6357 -0.1582 1.1671 -0.1412 -0.5933 3 
21 Armenia 2002 Кое T non бүз E -0.9157 0.5678 -0.1645 3.0263 -0.1384 -0.7932 3 
22 Brazil 2002 0.5221 1.6116 0.6947 -0.0312 -0.6646 3 
23 Реги 2002 0.3145 ТР 0.0268 CIE -1.2556 259 0.3488 0.5645 -0.1305 -0.5109 3 
24 Tajikistan 2002 1.0941 -— SIDE 7 тсе -0.1467 -0.1407 -0.7932 3 
Table 9: Results of clusters of countries in 2007 

N° Country Year REC EPC  FFEC EU EPOGCS ATE RIFRT ANE MEES EPFOS Cluster 
1 Bangladesh 2007 0.5524 9 2 d QS 
2 Brazil 2007 0.5466 UE ^ ol 
3 Colombia 2007 Uo a e dod g 02566 
4 Georgia 2007 0.1847 ОО jj 01346 
5 Ghana 2007 07809 5 es T EE 
6 Guatemala 2007 1.0687 Nos Т 
7 Indonesia 2007 0.3082 om фк 
8 India 2007 04674 | F g 00425 
9 Cambodia 2007 12640 4. a 7 TIUS 

| 
10 SriLanka 2007 09655 9 oa Tb 
11 Nigeria 2007 | 1.8490 2 
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N° Country Year REC EPC  FFEC EU  EPOGCS ATE RIFRT ANE MEES EPFOS Cluster 


0.6898 0.5381 1.0570 0.5977 
12 Nicaragua 2007 0.7269 (у eS @ 5716 буза 0.3345 ТР -0.1176 TUS -0.1371 | 1.5547 
13 Pakistan 2007 04656 (у eir doen à $07 0.2096 5:58 -0.1494 eT -0.0618 0.2720 
14 Peru 2007 0.1791 б ДЬ 0.1236 QOIS -0.7778 0.0243 0.3488 0.4272 -0.1278 -0.6943 
15 Philippines 2007 0.0214 m meee " a 0.2575 0.0268 -0.0162 бой -0.1138 -0.5076 
16 Vietnam 2007 0.3773 m M nm. Sup 0.1722 0.4066 -0.0538 052 -0.0420 -0.6621 
17 Armenia 2007 Td WU 01301 9 dod -1.0149 0.6096 -0.1645 | 2.1647 -0.1373 -0.7932 2 
18 Bulgaria 2007 UC 0.2917 0.3653 0.0930 -0.0173 0.6357 -0.1601 1.3774 -0.1340 -0.7494 2 
19 Belarus 2007 9578 0.0388 0.8451 0.1760 1.1839 0.6357 -0.1560 T -0.1339 -0.7674 9 
20 Bolivia 2007 Ts nm. EDU 05073 9 a -0.0403 т -0.0713 v jọ 0.1283 | -0.7349 2 
21 China 2007 uut О 0.6832 T q 0.6834 05920 0.7169 T" T T -0.7649 9 
22 Egypt 2007 m T An 09665 n 0.8042 0.5850 -0.1664 UT -0.0610 -0.3812 2 
23 Iran 2007 hoes Tm 1.0602 0.0965 0.9272 0.5936 -0.1263 EO 0.3517 -0.1360 2 
24 Iraq 2007 ИО СОО 1.0035 nen -0.1140 0.5712 -0.1556 m -0.1351 -0.2351 p 
25 Jordan 2007 ОТТ E n ^n ee 1.1725 0.5950 -0.1665 TU -0.1405 -0.2304 à 
26 Moldova 2007 б ue c EEG ОШ 1.0011 0.6357 -0.1662 ee дз 01376 -0.7904 2 
27 Mexico 2007 cac DUE 0.7322 T 0.6580 0.5716 -0.0382 hoe до 90867 -0.1236 2 
SURE 2007 3 0.0985 0.6061 Е 0.7310 0.6357 -0.1650 7, 0.1306 -0.5494 2 
Macedonia 0.4834 0.2978 0.3093 
29 Romania 2007 д В 0.5444 _ 0.0555 0.6357 -0.1534 0.1055 -0.0583 -0.7344 9 
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N° Country Year REC  EPC  FFEC EU EPOGCS ATE RIFRT ANE MEES EPFOS Cluster 


0.4022 0.1546 0.1581 
39 Rus 20097 — - 0.6863 07863 0.7666 0.1990 0.6357 11878 0.1486 0.7370 2 
Federation 0.8752 
31 Ukraine 2007 aon 0.0691 0.4855 0.2018 -0.3606 0.6289 -0.1494 1.1587 0.2354  -0.7824 2 
Table 10: Results of clusters of countries in 2012 
Y EP RIFR MEE  EPFO  Clust 
B бошу ea REC EPC FFEC EU ОСС TATE ANE had 
° г S T S S r 
0.263 Е 0.200 а б = б = 
1 Bangladesh 2012 3 0.657 0 0.713 1.1421 0.538 0.1337 0.622 0.1235 -0.3853 
5 p 3 5 
И 0.345 0.183 0.042 0.635 = 1.831 - 
2 Bulgaria 2012 ps 1 7 7 -0.1481 7 0.1601 2 0.1328 -0.7776 
Bosnia and = = = 
| 0.057 0.832 0.635 = 2 
3  Herzegovin 2012 0.497 9 2 0.174 0.3067 7 0.1555 0.208 0.1371 -0.7866 
a 2 6 9 
4 China 2012 0619 007^ 0916 Som озо 5994 лә 0227 -0.7859 
0 4 5 
0 2 2 
Dominican , E 0.598 à 0.563 = Ё - 
| ; i і 4 8621 
5 Republic 2012 0.454 0.385 а 0.516 0.7624 à 0.1593 Oo 
4 3 9 5 
6 Algeria 2012 | 0.988 0.443 1o 0330 MEE RA j 0.630 | 0.0336 | -0.7322 
8 8 0.1631 
8 8 4 9 
S 8 0.992 Ё 0.625 я Е И 
7 Egypt 2012 | 0.822 0.343 0 0.487 0.9275 с 0.1664 P 9229 
4 6 2 0 
| А Ё { 3 0.500 n 
8 Indonesia 2012 0.057 0.556 0.040 0.504 0.8577 б 0.4675 0.559 0.0653 -0.2986 
4 7 9 5 7 
9 India 2012 ice 0.559 2 0.588 0.6375 0.046 0.2875 0.384 0.4589  -0.7258 
0 3 5 9 
3 j 1.008 5 0.618 = 5 = 
10 Jordan 2012 | 0.898 0.291 B 0.465 1.1769 d — олло M 
8 6 9 1 
11 Morocco 2012 0.638 0.524 ©“ osoo D ^" К 0.529 р 0.0437 
9 1 0.1576 0.1112 
7 6 6 7 
12 Mexico 2012 0702 0.216 = 0.246 ово P E 0.359 0.0695 -0.1886 
9 6 0.0382 
8 4 9 5 
0.773 0.077 2 Е 
13 Mongolia 2012 p 5 Ы 1.1893 7 -0.6195 
j 0.98 0365 9 0.228 7 0.1557 9,639 0.1345 
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MEE  EPF 
Country V DP prec ЕС FEEG ки LESS arm КЕЁ мв DEC 
а г S T S S r 
0 4 8 2 
14 Malaysia 2012 0.912 0.250 ML 007 SEO 0015s ОЯ $ -0.6452. 
3 5 1 9 0.0162 
6 0 
0.008 б 0.330 Ў 0.333 0.183 z 
15 Peru 2012 | 0.443 | 0.557 -0.4398 7 0.3488 P 0.1253 BER 
8 3 
о 3 г к 0.197 3 li = 
16 Philippines 2012 0.058 0.572 0.233 0.639 0.3498 0.304 -0.6003 
2 0.0162 0.1163 
7 9 2 0 4 
a ў 0.322 Й 0.635 = 0.498 2 
17 Romania 2012 0.293 0.137 8 0.211 -0.1348 7 0.1534 8 0.0689 -0.7509 
0 9 6 
18 Russian 2012 | 0.889 0.760 0.796 0.917 02311 0.635 11878 0.130 -0.7065 
Federation 0 4 7 3 T 2 
Ў i 0.445 ч 0.605 8 В Е 
19 Thailand 2012 0236 0.173 4 0.172 0.9419 И SUE 0.527 0.0071 BEBE 
0 9 6 5 
К Н 0.748 E 0.635 z б 2 
20 Turkey 2012 0.570 0.101 5 0.263 0.3848 " 0054 09? 94 Ж 
5 0 7 5 
| 9 0.094 0.415 0.099 0.631 2 1.360 
21 Ukraine 2012 pe | _ % -0.3176 » mide 3 0.1686 -0.7843 
0.245 if 0.012 à 0.564 - 0.045 = 
22 Vietnam 2012 о р Е о 0.1382 | 5:8 5 0.0191 ЙИ 
South Е 0.256 0.674 0.083 0.136 = и 2 
i f 402 -0. 
2d Africa 2012 5 9 3 Lungo 8 0.1526 ? 3 0.1119 MEME 
: 0.309 = 1.920 Е 
24 Albania 2012 3 0.1582 7 0.1411 -0.7932 
| 0.423 0.634 - 
25 Brazil 2012 4 1.6116 3 0.0442 96760 
Cote 1.440 - Ё - 
At d'Ivoire 2012 1 0.1425 rd 0.1353 moss 
| д 0.427 - 
27 Colombia 2012 0.053 0.5071 5 0040; ИШ 
6 
| 0.244 z 0.847 : 
28  CostaRica 2012 7 0.1312 9 0.1399 -0.5234 
1.462 2 й z 
4 -0. 
29 Gabon 2012 6 0.1152 0 "d 0.1406 0.3394 
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30 Ghana 


31 Guatemala 2012 


32 Kenya 2012 


33 Sri Lanka 2012 


34 Nicaragua 2012 


35 Nepal 2012 


36 Pakistan 2012 


37 Zambia 2012 
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